AITopics | black-box backdoor defense

Collaborating Authors

black-box backdoor defense

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Black-box Backdoor Defense via Zero-shot Image Purification

Neural Information Processing SystemsDec-26-2025, 14:13:27 GMT

Backdoor attacks inject poisoned samples into the training data, resulting in the misclassification of the poisoned input during a model's deployment. Defending against such attacks is challenging, especially for real-world black-box models where only query access is permitted. In this paper, we propose a novel defense framework against backdoor attacks through Zero-shot Image Purification (ZIP). Our framework can be applied to poisoned models without requiring internal information about the model or any prior knowledge of the clean/poisoned samples. Our defense framework involves two steps.

black-box backdoor defense, name change, zero-shot image purification, (5 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.56)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.33)

Add feedback

SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification

Neural Information Processing SystemsMay-27-2025, 19:04:40 GMT

The advancement of Machine Learning has enabled the widespread deployment of Machine Learning as a Service (MLaaS) applications. However, the untrustworthy nature of third-party ML services poses backdoor threats. Existing defenses in MLaaS are limited by their reliance on training samples or white-box model analysis, highlighting the need for a black-box backdoor purification method. In our paper, we attempt to use diffusion models for purification by introducing noise in a forward diffusion process to destroy backdoors and recover clean samples through a reverse generative process. However, since a higher noise also destroys the semantics of the original samples, it still results in a low restoration performance.

black-box backdoor defense, perturbation-based sample detoxification, sampdetox, (6 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Black-box Backdoor Defense via Zero-shot Image Purification

Neural Information Processing SystemsJan-19-2025, 19:57:35 GMT

black-box backdoor defense, black-box model, zero-shot image purification, (3 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)
Information Technology > Artificial Intelligence > Machine Learning (0.60)

Add feedback